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Search for a heavy top i! => Wq in top events 

D. Cox, on behalf of the CDF Collaboration 
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We present a search for a massive quark (£') decaying to Wq and thus mimicking the top quark decay signature 
in data collected by the CDF II detector corresponding to 2.8 fb — 1 . We use the reconstructed mass of the 
t' quark and the scalar sum of the transverse energies in the event to discriminate possible new physics from 
Standard Model processes, and set limits on a standard 4th generation tl quark. 



1. Introduction 

In this study we investigate whether present data 
from the CDF detector allow or preclude the produc- 
tion of a hypothetical new quark which decays to a fi- 
nal state with a high-px lepton, large $t, and multiple 
hadronic jets having large total transverse energy Et 
thus mimicking top quark pair event signatures. We 
refer to the hypothetical new quark as t' for brevity, 
although such a signature could be a standard fourth- 
generation up- type heavy quark or any up- type quark. 
A number of theoretical models advocate a fourth 
chiral generation of massive fermions with the same 
quantum numbers as ordinary ones [H, 0, [E El IH • Pre- 
cision measurements from LEP exclude a light fourth 
neutrino v± with mass 771(1/4) < mz/2, where mz 
is the mass of the Z boson. A fourth generation 
neutrino cannot be too heavy due to sizeable radia- 
tive corections. Despite this, reasonable constructions 
of a fourth generation exist that remain viable 
In many theoretical models the addition of a fourth 
generation relaxes present bounds on the Higgs. In 
addition frequently a small mass splitting between 
new heavy quarks t! and b' is preferred, such that 
m(b')+m(W) > m{t') and t' decays predominantly to 
Wq (a W boson and a down- type quark q — d,s,b) 
For the purposes of this analysis we assume that the 
new quark is pair-produced strongly, has mass greater 
than the top quark, and decays promptly to Wq final 
states. The data we use in the analysis corresponds 
to 2.8 fb _1 of integrated luminosity collected at the 
Fermilab Tevatron in pp collisions at y/s = 1.96 TeV. 



2. Event Selection 

The CDF II detector is described in detail in Ref. 
Q. We can parameterize coordinates in the detector 
using the azimuthal angle <p and the pseudorapidity 
rj = — ln[tan(#/2)], where is the polar angle mea- 
sured from the proton beam direction. The transverse 
energy Et is defined as EsinO, where E is the energy 
deposited in a calorimeter cluster. The transverse mo- 
mentum pt of a track is the component of the track 
momentum transverse to the beam-line. The missing 
transeverse energy ]pT is the magnitude of the vector 



defined as -T,iE T h T , where n T is the transverse com- 
ponent of the unit vector pointing from the interaction 
point to the calorimeter tower i. This is corected for 
the pt of muons, which do not deposit all of their 
energy in the calorimeter, and tracks which point to 
uninstrumented regions in the calorimeter. 

Highly energetic quarks (such as the down-type 
quark produced in the hypothesized t' decay) undergo 
fragmentation that results in jets of hadronic particles. 
Jet candidates are reconstructed using the calorimeter 
towers with corrections to improve the accuracy of the 
energy estimation and are required to have fx > 15 
GeV and |r?| < 2.5. 

Candidate events for this t' search are required to 
have exactly one isolated electron or muon with pt > 
20 GeV/c, $ T > 35 GeV, at least four jets with E T > 
20 GeV. 

To reduce the contribution of the QCD background 
we also require a lead jet with Et > 60 GeV and two 
cuts one in the Acj> between the corrected $t and the 
lepton vs JfjT plane (requiring Acj) > A\ — (l/B\)]pT 
where A\ = 4.408; B x = 6.11) and one in the A0 
between If^T and lead jet vs J^t plane (requiring A</> > 
A 2 - {\/B 2 )$t where A 2 = 1.888; B 2 = 21.6). The 
dominant contributing backgrounds after this event 
selection are from electroweak process as well as ti 
pair production. Electroweak processes are dominated 
by W + jets. For the ti background we use Monte 
Carlo (MC) generated using an assumed top quark 
mass of 175 GeV. The QCD background is modeled 
using a sample of data where the lepton ID cuts have 
been reversed. Other backgrounds (including Z+jets, 
VFW^+jets, WZ+jets and single top events) have a 
smaller rate than W + jets and in addition have been 
found to have similar kinematic distributions to W + 
jes and so are modeled as one background using the 
W + jets model. 



3. Search Technique 

We utilize the fact that in our regime of interest the 
t' decay chain is identical to that of the top quark to 
reconstruct the t' mass in the same way as is done in 
the top quark mass measurement analyses. We use the 
template method for top quark mass reconstruction Q 
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based on the best x 2 -fit to the kinematic properties of 



the final top (or t') decay products, 
by the following expression: 



The expected number of events in each bin, is 
The x 2 is given given by the sum over all sources, indexed by j 
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where the invariant masses of the W decay products 
nijj and are constrained to the pole mass of the 
W boson, and the masses of top and anti-top (t' and 
t') quarks are required to be same. The jet and lepton 
energies as well as the unclustered energy (UE) are 
allowed to float within their resolution uncertainties. 
The transverse component of the neutrino momentum 
is determined as the negative sum of the lepton, jet 
and unclustered transverse energies: 



Pt 



For each event there are total 4!/2 = 12 combi- 
nations of assigning 4 jets to partons. In addition, 
there are two solutions to account for the unknown 
z-component of the neutrino momentum. After min- 
imization of the x 2 expression, the combination with 
the lowest x 2 is selected and the value of m t is de- 
clared to be the reconstructed mass M reco of top (or 
t>). 

We use the observed distributions of the M reco and 
total transverse energy in the event 



Ht — ^ ^ Et 

jets 



Ei 



(3) 



to distinguish the t' signal from the backgrounds by 
fitting it to a combination oft' signal, top, electroweak 
background, and QCD background shapes. 

We use a binned in Ht and M reco likelihood fit to 
extract the t' signal and/or set an upper limit on its 
production rate. We chose to use bins of 25 GeV in 
both Ht and M r eco with Ht in 26 bins from 150 
to 800 GeV and M r eco in 16 bins from 100 to 500 
GeV. The likelihood is defined as the product of the 
Poisson probabilities for observing rii events in 2D bin 
i of (H T ,M reco ); 



C(o- t > 



l[P(r 



7 , LjQ-jCij 



Here Lj is the integrated luminosity, Oj is the cross 
section, and is the efficiency per bin of (Ht, M reco ). 

We calculate the likelihood as a function of the t' 
cross section, and use Bayes' Theorem to convert it 
into a posterior density in cr t /. We can then use this 
posterior density to set an upper limit on (or if we get 
lucky, measure) the production rate of t' . 

The production rate for VF+jets is a free parameter 
in the fit. Other parameters are related to systematic 
errors and treated in the likelihood as nuisance pa- 
rameters constrained within their expected (normal) 
distributions. 

We adopt the profiling method [l(| for dealing with 
these parameters, i.e. the likelihood is maximized 
with respect to the nuisance parameters. Taking this 
into account the likelihood takes the following expres- 
sion: 

C(a t '\ni) = Y[P(ni\^i) x G(v k \^k, <7u h ) 

where Vk are the nuisance parameters, such as o~ t i, 
(2) Lj and etc. % are their central nominal values and 
o~ Vk are their uncertainties. G is a Gaussian function 
centered at Vk of width er„, . 



4. Systematic Errors 

The sensitivity to t' depends on knowing accurately 
the distribution of (HT,M reco ) in data. We consider 
several sources of uncertainty including the jet energy 
scale, W^+jets Q 2 Scale, initial and final state radi- 
ation (ISR and FSR), Parton Distrubiton Function 
(PDF) uncertaintity and others. 



4.1. Jet Energy Scale 

The sensitivity to t' depends on knowing accurately 
the distribution of (Hx,M reco ) in data. One of the 
largest sources of uncertainty comes from a factor that 
has a large effect on the shape of the kinematic distri- 
bution, the jet energy scale. Jets in the data and MC 
are corrected for various effects as described in (Tl| . 
leaving some residual uncertainty. 

This uncertainty results in possible shifts in the 
Ht and M reco distributions for both new physics and 
standard model templates. We take this effect into ac- 
count by generating templates with energies of all jets 
shifted upwards by one standard deviation (+1 tem- 
plates) and downwards (-1 templates) respectively. 
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We then use a template morphing technique that 
was developed in 2005 for a previous version of this 
analysis. We interpolate and extrapolate the expecta- 
tion value [ii at each bin i as follows: 



M0,i + vjes ■ (p+i,i - M-l,i)/2 



(4) 



where /io,i is the nominal expectation value, 
and are the expectation values from (-1) and 

(+1) templates respectively,andz/jes is the nuisance 
parameter representing the relative shift in jet energy 
scale: 



VJES 



^.JES 
VJES 



(5) 



It enters the likelihood (TJ| as a Gaussian constraint 



penalty term: G(vjes |0, 1) 



? /2 



4.2. Tv+jets Q 2 Scale 

The effect of the choice of the appropriate Q 2 scale 
for W+jets production is evaluated by measuring the 
resulting change in the measured t' cross section given 
that t' exists. The Q 2 scale is varied to 2Qn 0mina i and 
nominal ^ ne expected change in the measured cross 
section is then interpreted as the uncertainty on the 
t' cross section itself. We measure this shift as a func- 
tion of the t' cross-section by drawing pseudoexperi- 
ments from shifted templates and fitting them to the 
nominal distribution. The resulting shift is fitted to a 
linear function of that t' cross-section and is incorpo- 
rated into the likelihood as an additive parameter to 
the t' cross section, so that the t' contribution to the 
expectation value fii ^ in bin i becomes 



MM' — Lf{af + Vq2)eij> 



(G) 



where vqi is constrained by a gaussian with a width, 
that is a half of the largest of the upwards or down- 
wards shifts for each mass of the t'. 

The Q 2 systematic uncertainty for the different t' 
masses are shown in Table HI 

4.3. ISR and FSR 

We varied the amount of initial- and final-state ra- 
diation together, i.e. shifting both up or both down. 
We generated samples with more ISR and FSR as well 
as some with less ISR and FSR. We refer to these 
samples as IFSR more and IFSR less. We generated 
samples for t' with masses of 250, 300 and 350 GeV 
which brackets the region where we expect to be able 
to place our exclusion limit. 

The resulting effect is treated in a similar way to the 
Q 2 systematic. Templates are made for each of these 
mass points. Pseudoexperiments are thrown with the 
shifted top and t' IFSR samples, where the shift is 



m(t') 
(GeV) 


Systematic Uncertaintity 
(Pb) 


180 


0.065 


200 


0.044 


220 


0.021 


240 


0.011 


260 


0.013 


280 


0.009 


300 


0.007 


320 


0.005 


340 


0.006 


360 


0.005 


380 


0.003 


400 


0.003 


450 


0.003 


500 


0.002 



Table I Q 2 Systematic error for each tl mass point. 



set to be the same for top and V '. We then fit the 
obtained cross-section shift using a linear function of 
the t' cross-section. 

The resulting shifts are shown in Table |TTJ We add 
the resulting shifts in quadrature with the Q 2 error in 
the likelihood. 



m(t') 




IFSR 


(GeV) 


offset slope 


180 





125 


0.026 


200 





125 


0.024 


220 





125 


0.022 


240 





110 


0.020 


260 





080 


0.018 


280 





060 


0.017 


300 





035 


0.014 


320 





025 


0.011 


340 





015 


0.009 


360 





010 


0.008 


380 





007 


0.007 


400 





005 


0.006 


450 





004 


0.005 


500 





003 


0.004 



Table II Columns 2+3: Shift (in picobarns) in apparent 
t' cross section due to a shift in the initial- and final-state 
radiation up or down 



4.4. QCD Background 

The QCD background shape is modeled from a sam- 
ple of data in which the electron cuts have been re- 
versed. The QCD normalization is obtained by fitting 
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the background (electro-weak, top, and QCD) distrub- 
tions to the data with the missing Et cut removed 
and then computing how much remains after all cuts 
are applied. As most of the QCD is expected to be 
found at low missing Et . 

We investigated using leptons that fail the isolation 
requirements to model our QCD background. This 
gave an excellent description of the jet Et spectra 
but a very poor description of the lepton pt distri- 
butions, which were much steeper in this model than 
expected from the data. On the other hand the sam- 
ple requiring the opposite of the electron cuts seems to 
give a reasonable description of most of the data kine- 
matic distributions but has only very limited statis- 
tics; this means that the QCD templates need to be 
scaled up. Cutting very hard on the leading jet Et 
removes most of the QCD background which makes 
our fit rather insensitive to the QCD modeling. The 
relative normalization uncertainty is taken to be 50%, 
due to our lack of confidence in our model and nor- 
malization method. With our QCD veto cuts it turns 
out to change the fit by a negligible amount whether 
we constrain QCD or let it float. The uncertainty is 
represented by a Gaussian-constrained parameter in 
the likelihood. The QCD background has a negligible 
effect on the t' limit. 

4.5. Integrated Luminosity 

The integrated luminosity uncertainty is taken to be 
5.9%, and is represented by an additional gaussian- 
constrained parameter multiplying all contributions 
except for the QCD background, which is normalized 
from data. 

4.6. Lepton ID 

We have two components for lepton ID. First is the 
efficiencies for the individual electrons and muons. We 
multiply each lepton type by the associated efficiency 
and gaussian constrain it within the error on the effi- 
ciency. 

Second is the uncertainty on the lepton ID efficiency 
data/MC scale factor, which is of 2%, and taken as 
correlated across lepton types since it is due to the 
presences of multiple jets in an event. We add it in 
quadrature with the luminosity error, which is also 
correlated across lepton types, and include it with a 
gaussian constraint into the likelihood. 

4.7. PDF Uncertainty 

The PDFs are not precisely known, and this un- 
certainty leads to a corresponding uncertainty in the 
predicted cross sections, as well as the acceptance. 

This effect is evaluated on both the top and the 
t' MC samples. The method consists in re-weighting 



the existing MC samples by the relative PDF weights 
given the parton momentum fractions (xi, X2) and Q 2 
of the generated interaction. 

46 eigenvectors are considered. We look at the dif- 
ference between pairs of the CTEQ6M PDFs and add 
up these in quadrature. We then consider the differ- 
ence between the two MRST72 and CTEQ5L PDF 
sets. If this is smaller than the 20 PDF sets uncer- 
tainty, we drop it. If it is larger, we add it in quadra- 
ture. To investigate the effect of a s we look at the 
difference between the MRST72 and MRST75 PDF 
sets and add this in quadrature to the above errors. 

The final PDF unccrtaintes arc given for each t' 
mass point as well as for top in Table IIII1 A common 
conservative systematic error is added in quadrature 
to all other multiplicative factors and it taken as 1.1% 
for all templates. 



top 

175 1 +0.0110 -0.0112 
tprimc 



180 


+0 


007 


-0 


008 


200 


+0 


004 


-0 


005 


220 


+0 


005 


-0 


005 


240 


+0 


003 


-0 


003 


260 


+0 


003 


-0 


003 


280 


+0 


002 


-0 


003 


300 


+0 


001 


-0 


003 


320 


+0 


001 


-0 


002 


340 


+0 


002 


-0 


002 


360 


+0 


003 


-0 


002 


380 


+0 


002 


-0 


002 


400 


+0 


005 


-0 


002 


450 


+0 


004 


-0 


005 


500 


+0 


015 


-0 


013 



Table III PDF uncertainty on top and t! calculated by 
reweighting the events according to the probability (given 
the various PDFs) of finding an up and down quark with 
appropriate momentum fractions. 



4.8. Theory Uncertainty 

The theory uncertainty in the t' cross section is 
about 10% (see Table IIV|) . which is mainly due to 
uncertainty in PDFs (~ 7%). The other effect comes 
from uncertainty in the choice of the Q 2 scale [l2| • 

We take the theory uncertainty in tt cross section 
fully correlated with the one of t'i! , and introduce 
it into the likelihood as a single nuisance parameter: 

^theory — ^theory (j^t) • 
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5. Results and Conclusion 

We tested the sensitivity of our method by draw- 
ing pseudoexperiments from standard model distribu- 
tions, i.e. assuming no t' contribution. The ranges of 
the expected 95% CL upper limits with one and two 
standard deviation bandwidth are shown in Figure [T] 
The purple curve is the theoretical prediction [12l . I IL 31 ] , 




Figure 1: Upper limit, at 95% CL, on the production rate 
for t 1 as a function of t' mass (red). The purple curve is a 
theoretical cross section. The dark blue band is the range 
of expected 95% CL upper limits within one standard de- 
viation. The light blue band represents two standard de- 
viations 

the values of which are given in Table IIVI The lower 
o~ m in and upper o~ ma x limits are obtained using the 
CTEQ6M family of parton density functions with un- 
certainties, together with the study of the scale uncer- 
tainty [lij]. From Figure [T] it follows that given no t' 



m(t') (GeV) 


Omin (pb) 


^center (pb) 


CTmax (pb) 


180.0 


4.9938 


5.7476 


6.2396 


200.0 


2.7815 


3.1898 


3.4525 


220.0 


1.5926 


1.8236 


1.9710 


240.0 


0.9299 


1.0647 


1.1515 


260.0 


0.5499 


0.6302 


0.6828 


280.0 


0.3281 


0.3769 


0.4096 


300.0 


0.1968 


0.2268 


0.2475 


320.0 


0.1183 


0.1370 


0.1502 


340.0 


0.0711 


0.0828 


0.0914 


360.0 


0.0426 


0.0500 


0.0555 


380.0 


0.0255 


0.0301 


0.0337 


400.0 


0.0152 


0.0181 


0.0204 



Table IV Theory values of t! cross section for given 
mass [H, GJ| . 

presence, this method is on average sensitive to setting 



an upper limit at 311 GeV t' mass. The red curve in 
Figure Q] shows the final result expressed as a 95% CL 
upper limit on the t' production rate as a function of t' 
mass. Table IVl shows the individual calculated limits 
along with expected limits from pseudo-experiments 
for reference. 

Based on these results we exclude at 95% CL a t' 
quark with a mass below 311 GeV, given that the true 
top mass is 175 GeV. Of course, our measurement of 
the top mass may have been affected by the presence 
of a higher mass t' and thus we should treate these 
conclucsions with care. 

The 2D-distriubtion of (Ht, M reco ) is shown in Fig- 
ure [3] Distributions of Ht and M reco showing the 
result of the fit for m(t' ')=300 GeV are shown in Fig- 
ured] 

To determine if the data show any evidence of an 
excess in the tails of Ht and M reco , we decided a 
priori to count the number of events in groups of 
n x n of our standard 25 GeV bins in these quant- 
ties, and compare with the number predicted from a 
zero-signal fit to the full two dimensional spectrum. 
For each nx n bin one can then calculate the p-value 
for having observed that number or greater, given the 
prediction. If a significant effect is observed, one can 
calculate an overall p-value which is the probability 
that one would observe a p-value at least as signifi- 
cant as the most significant n x n bin or greater; this 
takes into account both the trials factor and the effect 
of systematic errors. Table IVTl shows the result of this 
counting experiment. The most significant n x n bin 
is for n = 10; the probability for observing 29 or more 
events given 18.03 expected is 0.01. (This assumes 
systematic uncertainty on the background.) Thus we 
conclude that there is no statistically significant excess 
in the far tails of Ht and M reco . 



m(t') (GeV) 


expected limit (pb) 


observed limit (pb) 


180 


2 


954 


+ 
-0 


818 
592 


3 


759 


200 


1 


959 


+0 
-0 


869 
525 


1 


595 


220 





693 


+0 
-0 


309 
207 





355 


240 





406 


+0 
-0 


152 
100 





258 


260 





288 


+0 
-0 


102 

072 





254 


280 





208 


+0 
-0 


072 
047 





237 


300 





150 


+0 
-0 


052 
034 





188 


320 





112 


+0 
-0 


044 
031 





165 


340 





088 


+0 
-0 


032 
024 





133 


360 





070 


+0 
-0 


026 
020 





112 


380 





056 


+0 
-0 


021 
016 





109 


400 





045 


+0 
-0 


017 
013 





081 


450 





033 


+0 
-0 


012 

008 





083 


500 





028 


+0 

-0 


Oil 

006 





073 



Table V Expected and obtained limits on t! production 
cross section for given mass. 
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Figure 2: Ht (left & M reco (right) distriubtions showing the results of the fit for m(t') — 300 GeV. The normalizations 
of the various sources and distortions of the kinematic distributions due to systematic effects are those corresponding to 
the maximum likelihood when the cross section for t' is set to its 95% CL upper limit. 



800 



Search for t' (2.8 fb- 1 ) CDF Run 2 Preliminary 




300 400 
M reco (GeV) 



Figure 3: 2D distriubtion of Ht vs M reco distribution 
showing hte data (black points) and the fitted number of 
background events: QCD (dark cyan circles), W + jets 
(green squares), and ti (blue triangles). 
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Mm Mrec 


Min H T 


observed 


expected p-value 




[GeV/c 2 ] 
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1 


475 
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1.000 


2 


450 


750 





0.116 


1.000 


3 


425 


725 


1 


0.228 


0.2040 


4 
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700 
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5 
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3 
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10 
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11 
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200 
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13 


175 


475 


92 
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14 


150 


450 
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15 


125 


425 


222 


231.0 


0.7318 
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a zero-signal fit to the full spectrum. For each value of n 
the table shows the p-value, the probability for observing 
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number expected. The minimum Ht and M reco values in 
each trial are also shown. 



[9] A. Abulencia et al. (CDF Collaboration), Phys. 
Rev. D 73, 032003 (2006) arXiv:hep-ex/051 0048| . 
[10] J. Conway et al, CDF Public note 7113. 
[11] A. Bhatti et al, Submitte d to Nucl. Instr. Meth. 

A [arXiv:hep-ex/0510047| . 
[12] M. Cacciari, S. Frixione, M. L. Mangano, P. Na- 
son and G. Ridolfi, J HEP 0404 (2004) 068 
|arXiv:hep-ph/0303085] . 
[13] R. Bonciani, S. Catani, M. L. Mangano and 
P. Nason, Nucl. Phys. B 529 (1998) 424 



[arXiv:hep -ph/980i375|. 



Proceedings of the DPF-2009 Conference, Detroit, MI, July 27-31, 2009 



[14] Personal communication with Michelangelo 
Mangano. 



